AD- A 190  641 


DOT/FAA/AM-88/1 

Offfice  of  Aviation  Medicine 
Washington,  D.C.  20591 


-ftt-Lcopj 


An  Evaluation  of  the  Effects  of 
High  Visual  Taskload  on  the 
Separate  Behaviors  Involved  in 
Complex  Monitoring  Performance 


Richard  I.  Thackray 
R.  MarL  Touchstone 


Civil  Aeromedical  Institute 
Federal  Aviation  Administration 
Oklahoma  City  ,  OK  73125 


.  * 
,‘r 

' 

/f 

,J 


®d  1*  p' 


January  1988 


This  document  is  available  to  the  public 
through  the  National  Technical  Information 
Service,  Springfield,  Virginia  22161 


DTIC 

'  MAR2  4t983i  ■ 


V  ■ 


© 


US  Deportment  of  Transportation 

Federal  Aviation  Administration 


88  8  22  «94 


i 


I  1  .  Rrpgr  *  N  o. 

If  aa-am-88-i 


AN  EVALUATION  OF  THE  EFFECTS  OF  HIGH  VISUAL 
TASKLOAD  ON  THE  SEPARATE  BEHAVIORS  INVOLVED 
IN  COMPLEX  MONITORING  PERFORMANCE 


7.  Author'*) 

Richard  I.  Thackray  and  R.  Mark  Touchstone 

9.  P*i  forming  Orgomtotion  Name  and  Addr#»* 

FAA  Civil  Aeromedlcal  Institute 
P.  0.  Box  25082 

Oklahoma  City,  Oklahoma  73125 


_ i > .  Spontonng  Agene*  Nam*  and  Addre»« 

Or  flee  ot  Aviation  Medicine 
Federal  Aviation  Administration 
800  Independence  Avenue,  S.W. 
Washington  D.C.  20591 


Technical  Report  Documentation  Page 

3  Recipient  s  Coralog  No. 

-  -  i 

*•  K  ri‘c- 1  Oa  1  e 

January  1988 

6.  Performing  Qrgcni  jott on  Code 
8.  Performing  Orgont  i of*  on  Report  No. 

"TO.  Work  Unit  No.  (TRAIS)  — 

I  l.  Controcr  of  Gront  No. 

13.  Type  of  Repor*  ond  Period  Covered 


14.  Sponsoring  Agency  Code 


15  Supp lementory  Notes 

This  work  waa  performed  under  tasks  AM-C-S8-PSY-104  and  AM-D-87-PSY-104. 

I 

16.  Absttoct 

Operational  monitoring  situation*,  In  contrast  to  typical  laboratory  vigilance  toeke,  genera  lly 
Involve  more  then  Juet  stimulus  dstsctlon  and  rscognltlon.  Thsy  frequently  Involve  complex 
multldlmenelonal  discriminations,  Interpretations  of  s !gn I f leones,  dsclslons  as  to  appropriate 
action.  Implementalon  of  aetlone,  and  evaluation  of  eonsequeneoa.  A  simulated  olr  traffic  control 
(ATC)  task  was  developed  to  study  ths  effect*  of  prolongod  monitoring  on  a  number  of  euch  behavior* 
embedded  In  the  context  of  the  task.  All  subjects  psrformed  ths  task  undsr  rslatlvsly  high  vlSU'1 
toskload  conditions  for  a  single  12S-mln  session^}  Two  type*  of  critical  event*  requiring  different 
levele  of  Information  processing  for  detection  wore  omploytd.  On#  typ#  of  event  consisted  of  a 

readily  dstsctlbls  loss  of  altitude  information  In  on  olphonumor Ic  data  block;  a  second  typo  of 

event  Involved  the  detection  of  two  aircraft  ot  the  some  altitude  on  the  earn*  flight  poth,  This 
letter  event  required  continuous,  sucossslvs  cospor Isons  of  doto  blocks  In  order  \o  bo  detected'. 
Following  detection,  a  decision  was  mode  as  to  whether  or  not  ths  situation  might  result  In  o 
potential  conflict  (collision).  Measures  derived  from  the  Implementation  of  eoch  typo  of  dociolon 
enabled  acquisition  of  data  on  .short-term  memory,  decision  1 1 . and  decision  errors,  procedural 
errors,  and  speed  of  motor  movement . ^The  results  revealed  that  time  to  detset  aircraft  ot  ths  some 
altitude  Increased  significantly  over  ths  monitoring  period  os  did  omission  errors  for  this  typo  of 
event.  Detection  time  for  the  more  readily  detect ibl*  olphonumerlc  changes  Involving  loss  of 

altitude  Information  showed  no  evidence  of  Impairment,  nor  was  ony  Impolrmsnt  found  for  ony  of  the 

other  task  behaviors  that  were  measured.  The  finding*  or*  dlscueeed  with  reference  to  previous 
studies  suggesting  that  complsx  monitoring  primarily  affects  ottentlonol  processes  ond  thot  ths  rots 
of  decline  In  attention  appears  to  be  related  to  the  degree  of  Information  processing  required  for 
event  detection. 


1  7.  K*y  Words 

18.  Distribution  Stottmont 

Air  traffic  control,  attention, 

Document  Is  available  to  the  pub  1 i o 

automation,  monitoring,  performance, 

through  the  National  Technical 

vlgl  lance  i 

Information  Service, 

vrt.-  .  . 

. .-J 

Springfield,  Virginia  22161 

19.  Security  Oass<f.  (of  this  report) 

Unclassl f led 
Form  DOT  F  1700.7  re-72) 


Unclasslf led 


Reproduction  ol  completed  page  authorized 


?1»  No.  of  Pagoi  22.  Pric« 

13 


i{ 


NOT !  CE 

This  document  Is  disseminated  under  the  sponsorship  of  the  Department  of 
Transportation  In  the  Interest  of  information  exchange.  The  United  States 
Government  assumes  no  liability  for  Its  contents  or  use  thereof. 


B 


Accession  Fop 

NT IS  GRA&I  jfe 

DTIC  TAB  13 

Unannounced  □ 

1  iMatlflnafl  f\n  _ 

1 

1 

1  . . . 

- 

Dlstr) 

Aval. 

Lbutlon/ 

Lability  Codes 

•DiSt 

'M 

Avail  « 
Spec 

jnd/or 

Lai 

AN  EVALUATION  ™  THE  EFFECTS  OF  HIGH  VISUAL  T ASKLOAD  ON  THE 
SEPARATE  BEHAV'ORS  INVOLVED  IN  COMPLEX  MONITORING  PERFORMANCE 


> 


1 


1.  Introduction 

It  Is  Increasingly  recognized  that  modern  operational  vigilance  tasks, 
such  as  those  related  to  air  traffic  control,  nuclear  control  room 
operation,  secur I ty-survel I  lance  systems  etc. ,  Involve  more  than  elmpty 
detecting  and  responding  to  Infrequent  critical  events.  They  frequently 
Involve  complex  multidimensional  discriminations  In  which  stimulus 
detection  or  identification  may  be  followed  by  Interpretation  of 
significance,  decisions  as  to  appropriate  action,  Implementation  of 
actions,  and  evaluation  of  conseauences  (Craig  1984,  Mackie  1984).  Yet, 
traditional  vigilance  studies,  for  the  most  part,  seldom  look  at  behaviors 
other  than  those  directly  related  to  at Imulus  detect  Ion.  This  would 
appear  to  be  true  not  only  for  laboratory  studies  using  simple  vigilance 
tasks,  but  for  studies  of  complex  monitoring  performance  as  well  (see 
Davis  and  Parasuraman  1982,  Paraauraman  1986  for  recent  reviews). 

In  an  effort  to  examine  the  effects  of  prolonged  monitoring  on  behaviors 
other  than  Just  stimulus  detection,  we  have  developed  a  laboratory 
simulation  of  an  air  traffic  control  (ATC)  task  that  Incorporates  many  of 
the  aspects  of  real-life  monitoring  situations.  As  It  is  currently 
configured,  the  task  simulates  an  Intermediate  level  of  ATC  automation  In 
which  the  computer  acts  as  an  aid  to  the  controller  In  resolving  aircraft 
conflict  situations.  Although  monitoring  for  infrequent  event  detection 
constitutes  the  principal  task  requirement,  the  task  was  developed  to 
enable  acquisition  of  data  on  short-term  memory,  decision  making, 
procedural  errors,  and  speed  of  motor  movement. 

Our  intitiai  study  with  this  task  examined  the  relationship  of  both  visual 
taskload  and  target  difficulty  to  detection  performance  (Thackray  and 
Touchstone  1985).  Subjects  monitored  either  8  or  16  alphanumeric  targets 
In  order  to  detect  critical  events  requiring  different  levels  of 
Information  processing  for  detection.  One  type  of  event  consisted  of  a 
readily  discernible  change  In  the  contents  of  an  alphanumeric  data  block; 
a  second  type  of  critical  event  Involved  the  detection  of  two  aircraft  at 
the  same  altitude  on  the  same  flight  path.  This  latter  event  required 
continuous,  successive  comparisons  of  data  blocks  In  order  to  detect  Its 
occurrence.  While  the  more  readily  detect Ible  events  showed  no  evidence 
of  performance  decline  at  either  level  of  visual  taskload,  the  more 
difficult  to  detect  altitude  events  showed  evidence  of  Impairment  that  was 
significant^  related  to  taskload;  the  number  of  such  events  not  detected 
increased  significantly  under  the  higher,  but  not  under  the  lower, 
task  load  condition.  Fatigue,  resulting  from  the  effort  required  to 
continuously  scan  and  process  Information  from  a  large  number  of  targets, 
was  offered  as  a  possible  explanation  for  this  Impairment.  This 
explanation  was  supported  by  the  finding  of  a  significant  decline  in 
critical  flicker  frequency  (OFF)  that  occurred  under  the  16-target,  but 
not  the  8-target  condition. 

Because  elements  of  the  task  Just  described  were  still  being  developed  at 
the  time  the  above  study  was  conducted,  only  data  relating  to  detection 
efficiency  (time  and  errors)  were  analyzed  In  that  study.  The  present 


study  represents  an  extension  of  this  earlier  one  and  was  conducted  to 
determine  whether  the  apparent  fatigue  resulting  from  prolonged  monitoring 
under  high  taskload  conditions  affects  only  attentlonal  processes  or 
whether  other  behaviors  relevant  to  complex  monitoring  show  Impairment  as 
well.  Effective  allocation  of  function  In  Increasingly  automated  systems 
requires  Information  on  how  prolonged  monitoring  may  affect  all 
performance  aspects  of  such  tasks,  not  Just  those  related  to  attention. 

The  present  study  also  sought  to  provide  further  Inforwat ion  on  the  visual 
behavior  of  subjects  during  times  when  critical  events  are  missed. 
Findings  obtained  In  several  of  our  previous  studies  suggest  that  critical 
events  (e.g.,  altitude  changes)  are  either  missed  (Thackray  and  Touchstone 
1985)  or  are  responded  to  with  excessively  long  detection  times  (Thackray 
and  Touchstone  1989)  In  spite  of  the  fact  that  subjects  appear  to  be 
scanning  the  display  throughout  the  session.  In  the  current  study, 
videotaped  recordings  of  eye  movement  activity  and  facial  orientation  were 
obtained  In  order  to  assess  visual  behavior  of  subjects  during  those  times 
when  missed  events  occurred. 


2.  Methods 

2.1  Subjects.  Forty-eight  men  and  women,  all  paid  university  students, 
volunteered  to  participate  in  the  study.  Subjects  ranged  In  age  from  18 
to  29  years,  had  20/20  uncorrected  vision,  were  nonsmokers,  and  had  no 
prior  experience  with  the  task  used  or  previous  ATC  training.  None  were 
currently  taking  any  prescription  medication  on  a  regular  basis. 

2.2  Apparatus  and  Task  Design.  The  basic  experimental  equipment  consisted 
of  a  Digital  Equipment  Corporation  (DEC)  VS11  19-ln  (49-cm)  graphics 
display,  keyboard,  and  Joystick,  all  of  which  were  Interfaced  with  a  VAX 
11/730  computer  (DEC).  The  computer  was  used  both  to  generate  Input  to 
the  display  and  to  process  subject  responses.  The  VS11  was  Incorporated 
into  a  console  designed  to  closely  resemble  an  ATC  radar  unit.  Two 
diagonal,  nonintersecting  flight  paths  were  located  on  the  display,  along 
which  aircraft  targets  could  move  In  either  direction.  A  given  Aircraft's 
location  was  displayed  as  a  small  "blip"  on  the  flight  path,  and  an 
adjacent  alphanumeric  data  block  Identified  the  aircraft  and  gave  Its 
altitude  and  groundspeed.  Aircraft  were  updated  In  position  and  any 
change  In  a iphanumer Ics  every  8  sec.  Figure  1  shows  a  typical  target 
pattern  as  displayed  to  the  subject,  with  the  total  console-display 
configuration  shown  In  Figure  2. 

The  subject's  task  was  to  continually  monitor  the  display  for  one  of  two 
types  of  change  In  the  alphanumeric  data  blocks.  The  duration  of  each 
type  of  change  (referred  to  as  a  critical  event)  was  90  sec;  if  a  subject 
failed  to  detect  a  critical  event  within  this  90-sec  period,  the  data 
block  containing  the  change  reverted  to  Its  previous  state. 

The  first  type  of  critical  event  was  readily  detectable  and  consisted  of 
three  X's  in  place  of  the  three  altitude  numbers  In  a  given  data  block. 
Subjects  were  told  that  this  replacement  of  an  altitude  value  signified 
that  a  transponder  malfunction  had  occurred  resulting  In  a  loss  of 
altitude  information.  Upon  detection  of  such  an  event,  subjects  were  told 
to  press  a  designated  button  on  the  console,  move  a  Joystick-controlled 


cursor  over  the  data  block  containing  the  critical  event,  and  to  press 
another  button  on  the  joystick  control  unit.  This  last  response 
"corrected"  the  malfunction  by  replacing  the  three  X's  with  the  previous 
altitude  value.  The  second  type  of  critical  event  was  more  difficult  to 
detect,  since  it  was  not  immediately  apparent.  This  event  was  the 
occurrence  of  two  aircraft  at  the  same  altitude  on  the  same  flight  path. 
As  soon  as  such  an  event  was  noted,  subjects  pressed  a  second  console 
button.  it  was  next  determined  whether  the  two  -Ircraft  were  moving 
towards  each  other,  away  from  each  other,  or  In  the  same  direction.  On 
the  basis  of  this  determination,  subjects  then  pressed  either  a  "Conflict" 
button  (Indicating  that  the  aircraft  were  movlnn  tow ad’  'tach  other)  or  a 
"No  Conflict"  button  (Indicating  that  the  aircr&it  /sre  either  moving  away 
from  each  other  or  were  moving  In  the  same  dliuction).  In  order  to 
prevent,  overlapping  data  blocks,  all  aircraft  In  this  study  were  assigned 
a  constant  speed  of  450  knots,  Thus,  only  targets  moving  towards  each 
other  would  constitute  a  potential  conflict  situation.  Allowing  a 
"conflict"  decision,  the  cursor  was  positioned  over  one  of  the  two 
conflicting  aircraft,  and  the  Joystick  control  button  was  pressed.  This 
caused  the  computer  to  assign  a  new  altitude  value  to  one  of  the  two 
conflicting  aircraft  and  display  thla  value,  along  with  the  aircraft's 
Identification  In  a  box  at  the  lower  left  of  the  screen.  Subjects  then 
verified  that  tne  computer-assigned  altitude  did  not  result  In  a  conflict 
with  some  other  aircraft  on  the  flight  path.  If  no  new  conflict  wee 
created,  a  keyboard  entry  was  made  that  assigned  the  new  altitude  value  to 
one  of  the  two  previously  conflicting  aircraft.  (Although  subjects  were 
led  to  believe  that  a  computer-assigned  altitude  might  occatslonal ly 
result  In  a  conflict  with  some  other  aircraft,  In  actuality  this  never 
occurred. ) 

Whenever  a  "no  conflict"  response  was  made,  no  further  action  ensued, 
since  no  change  In  altitude  was  required.  Subjects  were  told  that  the 
altitude  of  one  of  the  two  nonconf Met Ing  aircraft  would  eventually  change 
to  some  other  value  (this  change  always  occurred  60  sec  after  the  no 
conflict  response  was  made)  and  that  they  had  to  remember  that  they  had 
responded  to  this  particular  pair  of  aircraft.  If  they  felled  to  remember 
and  responded  a  second  time,  a  memory  error  wee  recorded. 

The  number  of  targets  on  each  flight  path  was  kept  equal  at  ail  times;  as 
one  left  the  screen,  another  appeared.  Nine  critical  events  occurred  In 
each  30-min  period,  with  no  more  than  one  event  present  at  any  given  time. 
Of  these  nine  events,  three  were  XXX'e,  three  were  conflicting  altitude 
changes,  and  three  were  nonconf I  let Ing  ciangee.  These  events  were 
arranged  in  a  quasi-random  order  with  th>  restriction  that  each  of  the 
three  types  of  events  had  to  occur  at  least  once  In  both  the  first  and 
second  15  min  of  each  30-mln  period.  Subjects  were  given  no  Information 
regarding  the  frequency  of  events  or  their  order  of  occurrence.  The  times 
between  events  (Interstimulus  Intervale)  ranged  from  126  to  302  sec  with  a 
mean  of  200  sec. 

2-3  jOdflfl  Recording  Methodology. 

A  miniature  Sony  CCD  TV  camera  was  mounted  in  the  lower  left  corner  of  the 
console  at  an  approximate  46  degree  angle  to  the  subject's  face.  The 
output  of  this  camera  wee  combined,  by  means  of  a  special  effects 
generator,  with  the  output  of  a  second  camera  located  to  the  rear  of  the 


subject  that  was  used  to  record  the  contents  of  the  simulated  radar 
display.  The  combined  outputs  of  both  cameras  were  displayed  on  a  video 
monitor.  A  small  indicator  light,  not  visible  to  the  subject,  was  located 
above  the  console  and  was  momentarily  illuminated  each  time  a  critical 
event  occurred.  Continuous  vldeotapo  recordings  enabled  subsequent 
playback  and  analysis  of  the  subject's  visual  behavior  during  times  when 
critical  events  were  not  detected. 

2.4  Procedure 

On  arrival,  subject#  were  played  a  tape  recording  that  stated  that  thla 
experiment  was  part  of  a  series  of  studies  designed  to  investigate  the 
role  of  the  controller  in  Increasingly  automated  ATC  systems.  They  were 
told  that  the  task  was  designed  to  simulate  an  Intermediate  level  of  ATC 
automation  In  which  computer  aide  are  used  to  assist  the  controller.  They 
were  then  given  task  instructions  and  separate  practice  In  responding  to 
each  kind  of  critical  event. 

In  order  to  add  a  greater  element  of  realism  to  the  task,  a  tape  recording 
of  background  noises  recorded  In  actual  air  traffic  control  radar  rooms 
was  played  continuously  during  the  2-hour  task  session.  Sound  level  of 
this  noise  at  the  subject’s  head  location  was  62  dBA.  It  was  not  expected 
that  this  would  have  any  effect  on  performance,  since  an  earlier  study 
using  a  previous  version  of  this  monitoring  task  failed  to  find  any 
significant  performance  effects  of  this  nolee  at  a  considerably  higher  (80 
dBA)  levol  (Thackray  1082).  At  the  completion  of  the  2-hour  task  period, 
subjects  were  given  a  thorough  debriefing  concerning  the  purposes  of  the 
exper Iment . 


3.  Results 

3.1  langfll  Dfllflfil |,on  lime  and  Errors  Qi  Omlaelon. 

As  described  earlier,  subjects  monitored  the  display  for  the  occurrence  of 
either  one  of  two  types  of  events.  The  first  type  of  event,  signifying  an 
altitude  malfuction,  consisted  of  an  XXX  that  replaced  the  three-digit 
altitude  value  In  an  alphanumeric  data  block;  the  second  type  of  event, 
constituting  a  potential  conflict  or  no  conflict  situation,  cou'd  only  be 
detected  through  continuous*  comparisons  of  each  target’s  altitude  with  the 
altitude  values  of  all  other  targets  cn  a  given  flight  path. 

Figure  3  shows  mean  detection  times  across  30-mln  periods  for  both  types 
of  event.  Separate  repeated  measures  analyses  of  variance  (ANOVAS) 
applied  to  thase  data  revealed  no  significant  change  across  the  2-hour 
session  in  detection  time  for  altitude  malfuction  events  C F( 3/1 41 )»i .68, 
p> .05) ,  but  e  significant  increase  in  time  to  detect  possible  confllct/no 
conflict  situations  (F(3/141 )-i5.47,  p<.001). 

With  regard  to  errors  of  omission,  the  more  readily  detectable  malfuction 
events  were  never  missed  by  any  of  the  subjects.  For  aircraft  at  the  same 
altitude,  however,  71%  of  all  subjects  missed  at  least  one  of  these 
occurrences  during  the  two-hour  session.  Since  the  actual  proportion  of 
events  missed  relative  to  events  presented  was  rather  small,  It  was 
decided  to  compare  omission  rate  during  the  first  and  second  hours  of  task 


performance  rather  than  during  eeparate  30-min  periods.  Combining  across 
subjects  and  events  revealed  that  2i  of  the  confllct/no  conflict  events 
ware  missed  during  the  first  hour  and  77  during  the  second,  yielding  miss 
rates  of  4%  and  13%  respectively.  A  Wllcoxon  comparison  of  the  first  and 
second  hours  revealed  the  increase  in  miss  rate  to  be  significant  (p<.05). 


FIGURE  3.  MEAN  DETECTION  TIMES  ACROSS  30-MIN  PERIODS  FOR  BOTH  LEVELS  OF 
EVENT  DIFFICULTY. 


3.2  Dac.ia.lfln  Tima  anfl  Dgclalon  Er-iaia* 

Following  a  subject's  response  to  the  detection  of  two  slrcraft  at  the 
same  altitude,  a  decision  was  made  as  to  whether  the  situation  represented 
a  potential  conflict  or  a  no  conflict  situation.  The  time  from  detection 
response  to  decision  response  was  obtalnsd  for  sach  altitude  event  for 
each  subject  with  means  displayed  In  Table  1.  Also  shown  In  Table  1  are 
data  for  a  second  measure  of  decision  time.  This  measure  consisted  of  the 

TABLE  1.  MEAN  TIMES  (IN  SEC)  FOR  SEVERAL  MEASURES  OF  DECISION 
BEHAVIOR  DURING  THE  TWO-HOUR  SESSION. 


Thirty-minute 

Per  tods 

Measure 

1 

2 

3 

4 

Con f /No  Conf 
Decision  Time 

6.29 

6.77 

6.12 

8.11 

Time  to  Accept 

Alt  Reeolut Ion 

4.36 

3.79 

3.95 

3.87 

time  between  a  subject's  Interrogation  of  the  computer  for  its  suggested 
resolution  to  a  conflict  and  acceptance  of  this  resolution.  Separate 
ANOVAs  pe.  formaci  on  the  two  sets  of  data  shown  In  Table  i  revealed  no 
evidence  of  any  Increase  or  decrease  In  confllct/no  conflict  dec  Ison  time 
across  the  2-hour  session  ( F( 3/ Ml < 1 , 00)  nor  any  evidence  of  a  significant 
change  in  acceptance  time  for  computer-generated  altitude  resolutions 
(F(3/141-2.29,  p> .05) . 

Decision  errors  were  recorded  whenever  a  conflict  decision  was  made  to  a 
no  conflict  situation  or  a  no  conflict  decision  to  a  conflict  situation. 
If  the  Incorrect  decision  was  then  followed  by  a  sequence  of  behaviors 
appropriate  to  the  decision  made,  this  would  suggest  an  Incorrect 
Interpretat Ion  of  the  altitude  event;  If  the  Incorrect  decision  was 
followed  by  a  sequence  of  behaviors  that  would  have  been  appropriate  to 
the  opposite  decision,  one  could  infer  that  the  subject  had  made  a 
careless  error  In  not  pressing  the  button  Intended.  Only  3  errors  of  the 
latter  type  were  documented,  suggesting  that  carelessness  was  not  a 
significant  factor  In  incorrect  decisions,  with  respect  to  the  rormer 
type  of  error,  14  were  made  during  the  first  hour  and  8  during  the  second, 
yielding  error  rates  of  2X  and  IX  respectively.  The  Wllcoxon  comparison 
of  first  and  second  hours  was  nonsignificant  (p>.05). 

3.3  Motor  Movement  Tima. 

in  order  to  obtain  an  Indication  of  possible  change  In  the  speed  of  motor 
activity  with  time  on  the  task,  measures  were  obtained  that  reflected  the 
time  taken  by  subjects  to  move  the  Joystick-controlled  cursor  from  the 
bottom  of  the  screen  and  locate  It  over  the  data  block  containing  a 
critical  event.  Two  similar,  but  separate  measures  of  such  behavior  were 
obtained;  those  associated  with  correcting  malfunction  events  and  those 
associated  with  resolving  altitude  conflicts.  Mean  times  for  each  measure 
are  shown  In  Table  2.  Separate  ANOVAs  yielded  no  evidence  of  a 
significant  change  In  time  to  complete  either  of  these  two  movement 
sequences  during  the  2-hour  session  (F(3/141)  <1.00  In  both  cases). 

TABLE  2.  MEAN  CURSOR  MOVEMENT  TIMES  (IN  SEC)  ASSOCIATED  WITH 
RESOLVING  MALFUNCTION  EVENTS  AND  ALTITUDE  CONFLICTS. 


Thirty- 

-minute 

Per lods 

Measure 

1 

2 

3 

4 

Movement  Times 
for  Malfunction 
Events 

6.88 

7.22 

6.47 

6.45 

Movement  Times 
for  Altitude 

Conf 1  let  Events 

6.60 

6.59 

7.25 

6.66 

7 


3.4  Memory  Errors. 


Whaneve  a  no  conflict  decision  response  was  made  to  two  aircraft  at  the 
same  altitude  on  the  same  flight  path,  the  altitudes  of  these  two  aircraft 
ramaln«d  the  same  for  a  60-sec  period  following  the  decision  response. 
During  this  time  period,  If  a  subject  failed  to  remember  having  previously 
responded  to  these  two  aircraft  and  made  a  second  detection  and  decision 
response,  a  memory  error  was  recorded.  The  frequency  with  which  such 
errors  occurred  was  found  to  be  quite  small.  During  the  first  hour  of  the 
session,  4*  of  the  no  conflict  situation*  were  responded  to  twice,  while 
during  the  second  hour,  the  error  rate  declined  to  3%.  A  Wllcoxon  test 
revealed  this  decrease  to  be  nonsignificant  (p>.05). 

3.5  Procedural  Errors.. 

As  described  previously,  detection  responses  to  both  malfunction  and 
altitude  conflict  events  were  always  followed  by  a  sequence  of  behaviors 
that  served  to  resolve  the  particular  event.  Whenever  any  element  of 
these  behavioral  sequences  was  performed  out  of  order,  was  omitted,  or  an 
Incorrect  element  added  to  the  sequence,  a  procedural  error  wae  recorded. 
Such  errors,  like  the  memory  errors  above,  occurred  quite  Infrequently, 
with  an  error  rate  of  only  2%  during  the  first  hour  and  4%  during  the 
second.  A  Wllcoxon  test  performed  on  these  data  revealed  the  Increase  In 
errors  from  the  first  to  the  second  hour  to  be  nonsignificant  (p>.06). 

3.6  vidootaofl  Ana  I  ya  la  oi  0m.Laa.lQn  Error  a. 

Videotaped  recordings  of  each  subject's  visual  behavior  during  the  session 
were  examined,  specifically  with  regard  to  visual  activity  during  times 
when  altitude  events  were  not  detected.  Thus,  for  each  mlaeed  confilct/no 
conflict  event,  visual  activity  was  examined  over  tne  90-sec  period  that 
the  event  was  present  on  the  screen.  Because  of  problems  with  the  video 
recorder,  and  because  the  subject's  seating  position  at  times  prevented  a 
complete  analysis  of  facial  orientation  and  visual  activity  over  the 
entire  90-aec  period,  not  all  missed  events  could  be  analyzed.  Of  the  98 
events  missed  by  the  subjects,  there  were  40  events  for  which  visual 
activity  data  was  available  during  all  of  the  90-eec  scoring  period.  As 
indicated  earlier,  the  Intent  of  this  analysis  was  not  to  provide  precise 
Information  on  fixation  times,  fixation  points,  or  scanning  patterns,  but 
rather  simply  to  gain  Information  on  general  visual  activity  during  times 
when  subjects  fall od  to  detect  alrcreft  targets  at  the  same  altitude. 
From  preliminary  viewing  of  the  tapes,  It  was  determined  that  any  portion 
of  the  scoring  period  could  be  categorized  in  one  of  three  ways:  (1)  Eyas 
open,  head  oriented  toward  screen,  continuous  scanning;  (2)  Eyas  closed; 
(3)  Eyes  diverted  from  screen. 

The  above  categories,  while  admittedly  rather  qualitative,  served  the 
purpose  for  which  they  were  Intended.  This  was  to  ascertain  the  extent  to 
which  the  Increase  in  frequency  of  missed  events  that  occurred  during 
monitoring  could  be  attributed  to  subjects  falling  tc  detect  thee*  events 
simply  because  their  eyes  were  either  closed  or  diverted  away  from  the 
display.  Analyses  of  the  tapes  revealed  that  97%  of  the  ecorabl*  missed 
events  occurred  during  periods  In  which  subjects  had  their  eyes  open  and 
were  actively  scanning  the  display.  One  event  was  missed  because  a 
subject's  eyes  were  diverted  from  the  display,  but  no  mlaaed  events  could 


be  attributed  to  a  subject's  eyes  being  closed  during  the  time  the  event 
was  present . 


4.  Discussion 

Detection  times  for  the  alphanumeric  change  used  to  Indicate  an  altitude 
malfunction  showed  no  evidence  of  any  Increase  over  the  2-hour  session, 
Mean  detection  time  averaged  9.2  sec,  and  these  events  were  never  missed 
by  subjects.  The  time  required  to  detect  aircraft  at  the  same  altitude, 
however,  Increased  significantly  over  the  session,  from  an  average  of  19.6 
sec  during  the  first  half  hour  to  28.8  sec  during  the  final  half-hour 
period,  in  addition  to  the  increase  In  detection  time,  the  frequency  with 
which  such  events  completely  escaped  detection  by  subjects  also  increased 
significantly.  Four  percent  were  missed  during  the  first  hour  and  13% 
during  the  second.  Taken  together,  these  findings  are  consistent  with 
those  obtained  previously  using  this  task  under  comparable  task  load 
conditions  (Thackray  and  Touchstone  1985). 

Although  the  ability  to  detect  aircraft  at  the  same  altitude  showed  clear 
evidence  of  impairment  over  the  2-hour  session,  the  processes  contributing 
to  this  impairment  are  not  Immediately  apparent.  Clearly,  the  ability  to 
detect  such  events  Involves  more  than  Just  attention;  memory  and  scanning 
would  also  appear  to  be  Important  components.  Yet  with  regard  to  the  role 
of  memory  as  a  contributor  to  this  decline,  It  should  be  noted  that  none 
of  the  di  'er  functions  or  subtask  elements  Involving  memory  that  were 
measured  in  the  present  study  showed  any  evidence  of  decline  during 
monitoring.  Thus,  neither  failures  to  remember  having  responded  to  a 
particular  no  conflict  altitude  event  nor  failures  to  remember  correct 
procedural  sequences  Increased  In  frequency  during  the  session.  In  like 
manner,  although  only  a  gross  assessment  of  scanning  activity  was  possible 
from  the  videotaped  recordings  of  visual  activity,  there  were  no  obvious 
indications  that  scanning  was  not  taking  place  during  times  when 
behavioral  evidence  (missed  events)  might  suggest  Inattentiveness. 
Further,  the  fact  that  detection  times  for  the  readl ly  perceivable 
malfunction  events  showed  no  change  across  the  session  would  also  suggest 
that  decreased  scanning  activity  per  se  would  not  appear  to  be  responsible 
for  the  decline  In  ability  to  detect  aircraft  at  the  same  altitude.  One 
Is  left  to  conclude,  then,  that  the  decrement  associated  with  these  events 
would  appear  to  be  specific  to  attention.  A  similar  conclusion  was  also 
reached  by  Johnston  pi  Ai-  (1966)  In  an  earlier  study  of  complex 
monitoring.  Performance  decrement  under  high  taskload  conditions  was 
found  to  result  primarily  from  an  Increase  In  lapses  of  attention,  the 
magnitude  of  which  did  not  appear  to  be  uniquely  affected  by  differences 
In  memory  requirements  of  the  task  conditions  employed. 

Memory  was  not  the  only  aspect  of  performance  that  failed  to  change  during 
monitoring.  There  was  also  no  evidence  of  change  in  me  rurej  of  decision 
time,  decision  errors,  or  motor  movement  time.  These  findings  are 
difficult  to  evaluate  because,  as  noted  earlier,  studies  of  complex 
monitoring  seldom  report  on  behaviors  apart  from  those  directly  related  to 
stimulus  detection.  However,  a  few  comparisons  can  be  made.  In  an  early 
study  by  Adams  pi,  pi.  (1961),  an  air  traffic  surveillance  task  was  used 
to  study  the  effect  of  prolonged  monitoring  on  decision  making,  In 
addition  to  the  usual  measures  of  target  detection.  Half  of  the  subjects 


made  only  a  simple  detection  response  to  an  alphanumeric  symbol  change 
while  the  remaining  half  were  required,  following  detection,  to  make  a 
four-choice  evaluation  Indicating  the  nature  and  location  of  the  change 
that  had  occurred.  Over  a  3-hour  monitoring  session,  performance  declined 
in  the  simple  detection  condition,  but  showed  no  evidence  of  decline  In 
the  condition  in  which  decisions  were  required.  These  findings  suggest 
that  the  dec'sion  requirements,  rather  than  adding  to  performance 
decrement,  appeared  to  have  prevented  It. 

With  regard  to  motor  movement  time,  a  subsequent  study  by  Adams  oi  &!• 
(1962)  again  used  an  air  traffic  surveillance  task  to  examine  the  effect 
of  nine  consecutive  dally  monitoring  sessions,  each  3  hours  long,  on 
detection  time  and  on  the  movement  time  required  to  complete  the  detection 
response.  This  tatter  measure  consisted  of  the  time  between  the  initial 
detection  response  and  response  to  a  second  button  on  a  panel  16  inches 
away.  Although  movement  time  did  slow  significantly  within  each  eeeelon, 
the  actual  magnitude  of  this  stowing  was  remarkably  small,  amounting  to 
approximately  SO  msec. 

The  findings  of  the  present  study  that  performance  decline  under  high 
(16-target)  task'oad  conditions  was  conflnsd  to  atteritlonal  behavior,  and 
within  that  realm  only  to  the  more  dtfilcult  task  of  detecting  two 
aircraft  at  the  same  aitltudo,  would  appear  to  support  conclusions  reached 
by  Davis  and  Paraauraman  (1982)  that  Information  processing  demands  placed 
on  the  observer  may  be  one  of  the  more  significant  determinants  of 
performance  decline  in  monitoring  tasks.  In  order  to  examine  this 
possibility  within  the  context  of  our  previous  research,  a  post  hoc 
comparison  was  made  of  the  present  findings  with  those  of  two  of  our 
earlier  studies.  All  studies  were  equivalent  In  terms  of  the  number  of 
alphanumeric  targets  employed,  critical  event  rates,  and  task  durations. 
The  principal  difference  between  studies  was  In  the  type  of  critical 
events  used,  in  the  earliest  of  these  studies  (Thackray  ai  al-  1070), 
the  critical  event  consisted  of  the  replacement  of  an  aircraft's  normal 
altitude  value  with  the  number  "999."  This  critical  stimulus,  much  like 
the  malfunction  events  of  the  present  study,  was  a  readily  apparent 
stimulus  change  requiring  minimal  Information  processing  for  Its 
detection.  In  a  subsequent  study  (Thackray  1982),  critical  stimuli 
consisted  of  a  change  in  an  aircraft's  displayed  altitude  to  a  value  that 
either  exceeded  an  upper  limit  or  was  below  a  lower  one.  like  the  "999" 
used  in  the  earlier  study,  such  changes  could  also  be  detected  without 
reference  or  comparison  to  any  other  information  displayed  on  the  screen, 
information  processing  requirements  in  the  later  study,  however,  would 
seem  to  be  greater  since  altitude  changes  bocame  signals  not  because  they 
assumed  some  fixed  numerical  value,  but  because  they  were  detected  as 
having  a  value  that  exceeded  previously  specified  upper  or  lower  limits. 

Mean  detection  times  obtained  In  these  two  previous  studies,  along  with 
data  for  the  confiict/no  conflict  attitude  events  of  the  present  study  are 
vhcwn  in  Figure  4.  Examination  of  this  figure  suggests  that  an  Increase 
in  the  level  of  information  processing  required  for  critical  event 
detection  not  only  Increases  average  detection  time,  but  appears  also  to 
Influence  the  decrement  function.  An  ANOVA  performed  on  the  data  of  the 
three  studies  supported  these  Impressions  by  revealing  a  significant 
effect  for  processing  level  ( F( 2/101 >-120,21 ,  pt.001)  and  a  aignlflcant 
level  by  periods  Interaction  (F(6/3Q3)»4.85,  p<.001).  Since  the  analyses 


conducted  In  all  three  of  these  studies  found  a  significant  main  effect 
for  periods,  It  Is  not  surprising  that  It  was  also  significant  In  this 
analysis  as  well  (F{3/303)«13. 35,  P<.001). 


FIGURE  4.  COMPARISON  OF  DETECTION  TIMES  FOR  ALTITUDE  EVENTS  DIFFERING 
IN  INFORMATION  PROCESSING  REQUIREMENTS. 


In  our  previous  study  comparing  monitoring  performance  under  8-  and 
16-target  conditions  (Thackray  and  Touchstone  1985),  It  wae  hypothesized 
that  the  requirement  to  passively  monitor  large  numbers  of  targets  over  a 
prolonged  period  of  time  demands  considerable  effort,  and  that  the  greater 
decrement  In  performance  found  under  the  higher  task  load  condition  was  a 
reflection  of  the  fatigue  resulting  from  this  effort.  The  results  of  the 
present  study  suggest  that  such  fatigue  effects  are  confined  primarily  to 
attentlonal  processes;  of  the  other  behaviors  that  were  measured  (decision 
making,  short-term  memory,  ability  to  correctly  carry  out  procedural 
sequences,  motor  movement),  none  showed  any  Increase  In  Impairment  over 
the  2-hour  session.  Further,  the  present  study,  In  agreement  with  our 
earlier  one  (Thackray  and  Touchstone  1985),  found  that  it  was  not 
detection  of  events  that  are  readily  apparent  to  the  observer  that  showed 
evidence  of  decline  under  high  taskload  conditions.  Rather,  it  was 
detection  of  those  events  that  require  considerable  Information  processing 
In  order  to  ee  "seen"  by  the  observer  that  were  most  adversely  affected  by 
prolonged  monitoring  under  these  conditions.  Data  presented  in  Figure  4 
suggest  that  Information  processing  demands  required  for  target  detection 
may  Interact  with  visual  taskioad  to  Influence  the  rate  of  attentionai 
decline  under  conditions  involving  extensive  scanning  of  multiple  target *■ 
Because  this  Interpretation  Is  based  on  a  post  hoc  comparison  o  n- 
findings  of  several  different  studies,  additional  rnsn.r  ch  to  . . 


j  ’ 


dffeet  of  doelining  attention  on  detection  of  targets  differing 
systematically  In  processing  requirements  and  presented  under  different 
levels  of  visual  taskload  Is  required  before  more  definitive  statements 
can  be  made.  Hopefully,  such  research  will  enable  us  to  specify  more 
precisely  the  kinds  of  stimulus  events  that  would  benefit  most  from 
computer-aided  detection,  especially  with  the  higher  ratios  of  aircraft  to 
controllers  that  are  anticipated  under  the  more  automated  ATC  systems 
being  contemplated  (Swedish  1883). 


12 


References 


i 


ADAMS,  J.  A.,  HUMES,  J.  M..  and  STENSON,  H.  H.,  1962,  Monitoring  of 

complex  visual  displays:  ill.  Effects  of  repeated  sessions  on  human 
vigilance.  Human  Factors.  £,  149-158. 

ADAMS,  J.  A.,  STENSON,  H.  H.,  and  HUMES,  J.  M. ,  1961,  Monitoring  of 

complex  visual  displays  II.  Effects  of  visual  load  and  response 
complexity  on  human  vigilance.  Human  Factors,  a,  213-221. 

CRAIG,  A.,  1984,  Human  engineering:  The  control  of  vigilance.  In 
Sustained  Attent Ion  2a  Human  Performance  (Edited  by  j.  S.  WARM) 
(Nee  York:  WILEY),  pp.  247-291. 

DAVIS,  D.  R.,  and  PARASURAMAN,  R . ,  1982,  Ifcft  Pavcholoov  at  Vial  lance. 

(New  York:  ACADEMIC  PRESS). 

JOHNSTON,  W.  A..  HOWELL,  W.  C.,  ,  and  GOLDSTEIN,  I.  L.,  1966,  Human 

vigilance  as  a  function  of  signal  frequency  and  stimulus  density, 
lauim  1.  at  EltQJI  I  mental  psychology.  21,  736-743. 

MACK l E ,  R.  R.,  1084,  Research  relevance  and  the  Information  glut.  in 
Human  Factors  Ravlawu  1984  (Edited  by  F.  A.  MUCKLER)  (Santa 

Monica,  California:  THE  HUMAN  FACTORS  SOCIETY,  INC.),  pp.  1-11. 

PARASURAMAN,  R.,  1988,  Vigilance,  monitoring,  end  search.  In  Handbook  q! 
Perception  and  Human  Performance  (Edited  by  K.  R,  BOFF,  L. 
KAUFMAN,  end  J.  P.  THOMAS)  (New  York:  WILEY). 

SWEDISH,  W.  J.,  1983,  Evolution  of  advanced  ATC  automation  functions. 
Mltro  Working  Paper  Report  83W149,  The  Ml tre  Corporat Ion,  McLean, 
Virginia. 

THACKRAY,  R.  i.,  1982,  Some  effect*  of  nolee  on  monitoring  performance  and 
physiological  response.  Academic  Psychology  Bulletin.  £,  73-81. 

THACKRAY,  R.  I.,  BAILEY,  J.  P . ,  and  TOUCHSTONE,  R.  M. ,  1979,  The  effect 
of  Increased  monitoring  load  on  vigilance  performance  using  a 
simulated  radar  display.  Ergonomics.  21,  529-539. 

THACKRAY,  R.  I.,  and  TOUCHSTONE,  R.  M. ,  1980,  Visual  search  performance 
during  simulated  radar  observation  with  and  without  a  eweepllne. 
Aviation.  Saaofl*  and  Environmental  Medicine.  SI,  361-366. 

THACKRAY,  R.  I.  and  TOUCHSTONE ,  R.  U.,  1985,  The  effect  of  visual 

taskload  on  critical  flicker  frequency  (CFF)  change  during 

performance  of  a  complex  monitoring  task.  FAA  Office  of  Aviation 
Medicine  Report  No.  AM-85-13,  1985. 


